A Comparative Study on Feature Extraction Technique for Isolated Word Speech Recognition
ثبت نشده
چکیده
Digital Speech Signal Processing is the process of converting one type of speech signal representation to another type of representation so as to uncover various mathematical or practical properties of the speech signal and do appropriate processing to support in solving both fundamental and deep troubles of interest. Digital Speech Processing chain has two different main model They are Speech Production Model/Generation Model which deals with acoustic waveform and Speech Perception Model/Recognition Model deals with spectral representation for recognition process. Digital Speech Processing used to achieve reliability, flexibility, accuracy, real implementations on low-cost digital speech processing chip, facility to integrate with multimedia and data, encryptability/security of the data and the data representations via suitable techniques. The overall process of production and recognition of speech is to convert the speech signal from the device or human, and to understand the message is speech chain. In other word, the process of converting the speech signals into acoustic Abstract: One of the common and easier techniques of feature extraction is Mel Frequency Cestrum Coefficient (MFCC) which allows the signals to extract the feature vector. It is used by Dynamic Feature Extraction and provide high performance rate when compared to p technique like LPC. But one of the major drawbacks in this technique is robustness. extraction technique is Relative Spectral feature coefficient and in both the log spectral and the distortions as an additive constant. The high the convolution noise introduced in the channel. The low frame spectral changes. Compared to MFCC feature extraction technique, RASTA filtering reduces the impact of the noise in signals and provide
منابع مشابه
Comparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition
The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...
متن کاملComparative Study on Feature Extraction Methods in Speech Recognition
Speech recognition is an analysis side of the main topics machine speech processing. The synthesis side may be called speech production. These two taken together allow computers to cooperate with spoken language. My study specializes in isolated word speech recognition in Gujarati language. Special message recognition, in humans, is 1000s of years old. On our planet it would be traced backed mi...
متن کاملFeature Extraction and Dimensionality Reduction using IPS for Isolated Tamil Words Speech Recognizer
Automatic Speech Recognition (ASR), is the process of converting a speech waveform into the text quite similar to the information being communicated by the speaker. This paper aims to construct a speech recognition system for Tamil language. Mel Frequency Cepstral Coefficients (MFCC) is a commonly used feature extraction technique for speech recognition which is computed by applying DCT to the ...
متن کاملThe research on Uighur speaker-dependent isolated word speech recognition
Wushour·silamu Caiqin Nuominghua College of information science and engineering Xinjiang University, Urumqi 830046 Abstract: A small vocabulary, isolated word speech recognition system in java has been realized. In this system we have done the extraction of feature parameter, the training of speech model parameter and the recognition of the recorded speech. MFCC is used as feature parameter, HM...
متن کاملA VLSI Architecture of Tone Classification Function-Based Isolated-Word Speech Recognition
Speech recognition in tonal languages such as Thai, Chinese, etc. classifies word meaning by using tone. Therefore tone classification function is extremely essential part for improving accuracy rate. This paper presents a novel VLSI architecture of tone classification function-based isolated word speech recognition. The architecture consists of two parts; feature extraction and tone classifica...
متن کامل